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Abstract — Self-Repairing Codes (SRC) are codes designed to 
suit the need of coding for distributed networlted storage: tliey 
not only allow stored data to be recovered even in the presence 
of node failures, they also provide a repair mechanism where as 
little as two live nodes can be contacted to regenerate the data 
of a failed node. In this paper, we propose a new instance of 
self-repairing codes, based on constructions of spreads coming 
from projective geometry. We study some of their properties 
to demonstrate the suitability of these codes for distributed 
networked storage. 

Index Terms — self-repair, projective geometry, coding, dis- 
tributed storage 

I. Introduction 

Storing digital data is a basic necessity of modern societies. 
The volume of data to be stored is tremendous, and is rapidly 
increasing. The kinds of data vary widely - from corporate 
and financial data repositories, archive of electronic commu- 
nications to personal pictures, videos and work documents 
stored and shared in Web 2.0 and cloud based services, and 
much more. Distribution of such huge amount of data over 
multiple networked storage devices is thus the only practical 
and scalable solution. 

All across the wide gamut of networked distributed storage 
systems design space, eventual failure of any and all indi- 
vidual storage devices is a given. Consequently, storing data 
redundantly is essential for fault tolerance. Furthermore, over 
a period of time, due to failures or departure of storage devices 
from the system, the redundancy will gradually decrease - 
risking the loss of the stored data, unless the redundancy 
is recreated. The possible ways for recreating redundancy 
depends on, to start with, the kind of redundancy being used. 

Data redundancy can be achieved using replication - how- 
ever that entails a very large storage overhead. Erasure coding 
based strategies in contrast can provide very good amount of 
redundancy for a very low storage overhead. However, when 
an encoded data block is lost and needs to be recreated, for 
traditional erasure codes, one would first need data equivalent 
in amount to recreate the whole object in one place (either 
by storing a full copy of the data, or else by downloading 
adequate encoded blocks), even in order to recreate a single 
encoded block. Such drawback of traditional erasure codes has 
in recent years given rise to a new flavor of coding research: 



designing erasure codes which need much less information to 
carry out the recreation of a lost encoded block. 

A. Related Work 

More precisely, consider a network of n storage nodes, each 
with storage capacity a, where an object o of size B has to 
be stored. A source possibly processes (encodes) the object 
o, splits it into n blocks, each of size at most a, and stores 
such blocks at n storage nodes. When a data collector wants 
to retrieve the object, he should be able to do so by contacting 
a subset of live nodes. We define k as the minimum number of 
nodes that need to be contacted to retrieve the object, where 
the data collector may download upto ka amount of data, and 
possibly process (decode) the downloaded data. For maximal 
distance separable (MDS) erasure codes, any arbitrary subset 
of k nodes allow data retrievability. New nodes joining the 
network are assumed to perform the repair by contacting d 
live nodes, from each of which they download /? amount of 
data. 

There are arguably two extreme points possible in the 
design-space of codes for distributed networked storage: 

(i) Minimize the absolute amount of data transfer d/3 needed 
to recreate the lost data from one node. Network-coding 
inspired analysis determines the storage-bandwidth (per repair) 
trade-offs, and a new family of codes called regenerating codes 
(RGC) 121, is) have been proposed, which can achieve (some 
points on) such a trade-off curve, under the assumption that 
d > k. Regenerating codes, like MDS erasure codes, allow 
data retrievability from any arbitrary set of k nodes. 

(ii) Minimize the number of nodes to be contacted for 
repairing one node failure. Recently proposed self-repairing 
codes achieve this optimal, by allowing one repair while 
contacting only two nodes, i.e. d ^ 2. More specifically, self- 
repairing codes satisfy two cardinal properties, namely: (a) 
repairs can be performed directly through other subsets of 
nodes, without having to download data equivalent to that 
needed to reconstruct first the original object, ensuring that 
(b) a block is repaired from a fixed number of blocks, the 
number depending only on how many blocks are missing and 
independent of which specific blocks are missing. Note that 
minimization of the number of contacted nodes for a repair is 
achieved when the fixed number in clause (b) is in fact two. 



Homomorphic self-repairing codes (HSRC) were proposed 
in f4l, which, besides satisfying the cardinal properties elab- 
orated above, were shown to (i) have {n — l)/2 distinct 
pairs with which the data for a single missing node could 
be regenerated, and consequently, (ii) self-repair for up to 
(n — l)/2 node failures could be carried out simultaneously 
using two nodes for each self-repair, from the pool of the 
remaining (n + l)/2 live nodes. 

B. Contributions 

This paper proposes a new family of self-repairing codes 
(PSRC) derived from a projective geometric construction. 
Besides the fundamental difference in its construction, and 
apart from satisfying the cardinal properties of self-repairing 
codes, as well as, in fact the other properties satisfied by 
HSRC, PSRC has several other salient features, as summarized 
next: 

(i) Both the encoding and self-repair processes for PSRC 
involve only XOR operations, unlike HSRC encoding which 
involved the relatively more expensive task of evaluating a 
polynomial. 

(ii) Similar to regenerating codes p], in PSRC, each en- 
coded block (i.e., data stored by a node) comprise of several 
(a) pieces. Regeneration of the whole encoded block thus can 
likewise be done by regenerating the individual constituent 
pieces. This is in contrast to HSRC, where the encoded blocks 
were 'atomic', and hence repair of the whole encoded block 
had to be carried out atomically. This gives PSRC some of 
the advantages of regenerating codes, while also naturally 
retaining the advantages of self-repairing codes, and provides 
several additional desirable properties, as elaborated next. 

(iii) For self-repair of a specific node, if one live node 
is chosen arbitrarily, then there are several other nodes with 
which the first chosen node can be paired to regenerate the 
lost encoded block. This is in contrast to HSRC, where there 
is a unique pairing for one lost node, once one live node is 
chosen. 

(iv) While the resulting code is strictly speaking not system- 
atic in terms of what is stored at each node, if the constituent 
pieces stored over the nodes are considered, then systematic 
reconstruction of the object is possible, though this will need 
communication with ak << n specific nodes. 

II. Background from Projective Geometry 

The proposed construction as described in next section relies 
on the notion of spread coming from projective geometry. We 
thus start by providing the required background. 

Consider the finite field ¥q, where q is a power of a prime 
p, and a vector space of dimension m over F,, namely, a 
projective space denoted PG{m — 1,(7). Note that we will 
adopt a row vector convention for the rest of the paper 

Definition 1: Let V he a projective space. A t-spread of V 
is a set S of t-dimensional subspaces of V which partitions V. 
That is, every point of V is contained in exactly one t-space 
of S. 



If 7'=PG(m— 1, q) is a finite projective space, then a t-spread 
can only exist if the number of points of a t-space divides the 
number of points of the whole space, i.e., if ''q-i 
and hence (q*^^ — — 1), which holds if and only if 

(t + 1)1 m. Andre iQ] showed that this necessary condition is 
also sufficient. 

Theorem 1: fSj In PG(to — 1,(7), a i-spread exists if and 
only if t + 1| m. 

A systematic construction of spreads can be obtained 
through field extensions as follows. Suppose that < + 1| m. 
Consider the finite fields Fq = F,, Fi = F^t+i and F2 — F^™. 
Then Fi^ Fi <Z F2- The field F^ is an m-dimensional vector 
space V over Fq. The subspaces of V form the projective space 
7-'=PG(to, (7). The field Fi is a + l)-dimensional subspace 
of V and hence a ^-dimensional (projective) subspace of V. 
The same holds for all cosets aFi, (a G ^2)- These cosets 
partition the multiplicative group of F2. Hence they form a 
t-spread of V. 

Example 1: Take as base field Fq = F2, i.e., the alphabet 
is {0, 1}. In order to obtain planes, we consider 1-spread, i.e., 
t = 1 and hence Fi = F4. Finally, assume m — 4, that is 

F2 = Fie: 

F2 = F16 

I ~ 

Fi =F4 
I 2 

Fo =F2 

Denote by F* the multiplicative group of F^. Recall that F* 
is a cyclic group. Let oj and ly be the respective generators of 
F2* and F*. We have that f is an element of order 3 contained 
in F2, so = Lo^. Thus F* can be written Fj* — {1, w'', w^"}. 
As F2* can be written 

i=l 

we have a partition of Fig into cosets of the form w'F^, i ~ 
1, . . . , 5. These five cosets define five disjoint planes. More 
precisely. Fig can be decomposed into direct sums of F2: 

F16 = F4 © I/F4 = F2 © i'¥2 © a;F2WJ^F2, 

so that each element of Fig can be written as a 4-tuple. For 
example, the coset a;F| contains the elements uj,uji',uji''^. 
As i/^ = v + 1, Lov^ is the sum of the two other points. 
Thus writing uj = (0,0,1,0) and utf — (0,0,0,1), we 
finally get that the plane defined by the coset a;F| is 
{(0010), (0001), (0011)}. 

III. Code Construction 

Recall that our goal is to encode an object of size B to 
be stored over n nodes, each of storage capacity a, such that 
each failure can be repaired by contacting any d live nodes, 
d > 2. We denote by PSRC{n, k) the self-repairing code 
with parameters n and k obtained from a spread construction. 



Ni to Nr 


Ns to Afi4 


Nis to N21 


(100000),(110111) 

/A 1 nr\rvA\ / i a i a i i \ 

(U1UUUU),(1U1U1 1) 
(001000),(100101) 
(000100),(100010) 
(000010),(010001) 
(000001),(1 11000) 
(110000),(011100) 


(011000),(001110) 
(UUl 1UU),(0001 11) 
(0001 10),(1 10011) 
(00001 1),(101001) 
(110001),(100100) 
(101000),(010010) 
(010100),(001001) 


(001010),(1 10100) 

/AAA 1 A 1 \ /A 1 1 A 1 A\ 

(UUU1U1),(01 lulu) 
(110010),(001101) 
(011001),(110110) 
(111100),(011011) 
(011110),(111101) 
(001111),(101110) 



TABLE I 

Basis vectors for the scenario where we have B = 6, a = 2, 
n = 1 + 2^ + {2^)2 = 21 NODES Afi,. . .,N2i. 



We will assume for simplicity that we work over the 
base field F2, though spreads can be constructed over larger 
alphabets. 

A. Setting the Parameters and Encoding 

1) We first set m = B,so that we are working with elements 
in F2 = FgB, that is B-dimensional vectors over F2. 

2) Consider a f-spread S formed of ^-dimensional subspaces 
of V such that t + 1\B. In particular, take Fi = F^t+i. Since 
every subspace is a (t+l)-dimensional vector space over F2, it 
is described by a F2-basis containing (t + 1) vectors. We thus 
set t + 1 = a, and assign to each node an F2 -basis containing 
a vectors. The number of nodes that will store the object is 
consequently (at most) 

2^-1 

n = . 

2" - 1 

Since we must take a\B, that is B = ha, we can further write 

"=|!73r = l + 2" + (2")2 + ... + (2")^-i. (1) 

3) Let us denote by Vi the collection of all na vectors, 
ordered such that t;i , . . . , w^t correspond to the first node, 
Va+i, ■ ■ ■ , V2a to the sccond node, etc. What the ith node will 
store is actually 

for a total storage of a. 

Example 2: Consider the partition described in Example [T] 
where we recall that = v + \, ~ 15, v^^ — \ lS^ — 
UJ + l, \Vl\ = 3, a;3 = I, u = ^ v"^ + v. 

The final partition of the space is thus: 

F: = {(1000), (0110), (1110)} 
u¥l = {(0100), (0011), (0111)} 
v'^Vl ^ {(0010), (1101), (1111)} 

^^f:; = {(0001), (1010), (1011)} 

v^¥l ^ {(1100), (0101), (1001)} 

This corresponds to the code parameters B — A, a = 
2, n 1 + 2^ = 5 from Let us denote by N^, i = 1, . . . , 5 
the 5 storing nodes, with storage capacity a ~ 2, and by 
o = (01,02,03,04) the object to be stored. For example, we 



tS — OCX. 


ex. 


„ 1 1 OQ 1 / na\b — 1 

Ti — l + z H-...(_z ) 
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2 


5 


6 
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21 


6 


3 


9 


8 


2 


85 


8 


4 


17 



TABLE II 

Set of some small available parameters for PSRC{n, k). 



can use the basis vectors as follows: 



node 




basis vectors 


data stored 


Ni 


Vl 


= (1000), V2 = 


(0110) 


{01,02 + 03} 


N2 


V3 


= (0100), V4 = 


(0011) 


{02, 03 + 04} 


N3 


V5 


= (0010), V6 = 


(1101) 


{03, 01 + 02 + 04} 


N4 


V7 


= (0001), vs - 


(1010) 


{04, 01 + 03} 


N5 


V9 


= (1100), vw = 


(0101) 


{oi +02,02 + 04} 



Furthermore, the first available parameters are summarized 
in Table UlLAl 

B. Repair 

We now need to make sure that the above coding strategy 
allows for object retrieval and repair We start with repair of 
data stored in one storage node. It was shown in [4 1 for HSRC 
that it is possible to repair data for one node by contacting 
d = 2 nodes, and there are {n — l)/2 such choices of 2 nodes 
that allow repair This holds also for PSRC. 

Lemma 1: Suppose we have n nodes, each storing a pieces 
of data encoding an object using PSRC{n, k). Then if one 
node Ni fails, it is possible to repair it by contacting d = 2 
nodes. More precisely, for any choice of node Ni among the 
remaining n — 1 live nodes, there exists at least one node Nj 
such that TV; can be repaired by downloading the data stored 
at nodes Ni and Nj. 

Proof: The ^th node Ni stores a subspace of the form 
j^'Fjc I — 1, . . . ,n. Let us assume this kh node fails, and a 
new comer joins. It contacts any node, say Ni. Since Ni stores 
j^^Fjc we need to show that there exists a node A'^ such that 

,.'¥*2^l[,.'¥*2^ 

repairs iV;. Now 

(j/* + j/')F;„ C iy'¥*2a.Y[iy^¥*2o, 

so we can take j such that i^^ = jy^ + By combining the 
data stored at node Ni and Nj, we thus get 

i^^F*„ Hiiy^ + ,y')¥*2^ 

which contains i^'Fja. ■ 
Example 3: Let us continue with Example |2] If say A^i 
fails, the data pieces oi (corresponding to the basis vector 
(1000)) and 02 + 02 (corresponding to the basis vector (0110)) 
are lost. A new node joining the network can contact nodes 
A^3 and A^4, from which it gets respectively W5 = (0010), 
ve = (1101) and vj = (0001), vs = (1010). Now + W5 
gives (1000) while vs + (we + vi) gives (0110). 



Actually, in general, the redundancy for self -repair provided 
by PSRC is even stronger than that of HSRC, as we now 
illustrate. 

Lemma 2: Suppose we have n ~ 21 nodes, each storing 
a = 2 pieces of data, encoding an object of size B ~ Q using 
PS'i?C(21, 3), as summarized in Table |2] Then if one node 
Ni fails, for any choice of node A^; among the remaining 20 
live nodes, there exists three nodes Nj^, Nj^, Nj,^ such that 
Ni can be repaired by downloading the data stored at either 
nodes Ni and Nj-^ , or Ni and Nj^ , or even Ni and Nj,^ . 

Proof: Recall that oj is the generator of the cyclic group 
F4. We have that node Ni stores i^'FJ, and Ni similarly stores 
j/T*. Now 

{l/^ + v\ V^UJ + I^'w, V"^ + V''ijj + + V^ijJ 

+ v^LO, + + v\ i/'^uj + i/' + i/'w 

I^^OJ + I'^iy'^ + I/V + I^'-Ld, + + U^Oj} = 

Take ji, j2, js such that 

j/J'i = i^^ + J/', ly^^ =iy' + f^Lo, v^'' ^v' + u'oj. 
We have then 

(iV^iVj,) ^ j.^F2[](z.' + z.'t^)F:Dj.'F:, 

■ 

This proof actually gives an algorithm to find the different 
pairs that repair a given failed node. 

Example 4: Consider the code described in Table |2l and 
suppose that the node Ni fails, and a new comer contacts 
node A^4 which stores i/^¥l. We have 

l^^+v' = 1 + 1/3 = =^Ni2 
V^LU + V^ = uj + v-^ ^ v'^^v^ ^ Nw 
+ v'uj = 1 + V^UJ ^u'^ N5. 

Thus the node A^i can be repaired by contacting the following 
three pairs all involving A'4: 

(7V4,A^12), (iV4,^10), {N4,N5). 

C. Object Retrieval 

If a data collector connects to any choice of k nodes, then 
he can access upto ka blocks, while trying to reconstruct an 
object of size B. Thus, k > B/a. Note that in the examples 
considered in this paper, k — B/a. 

Lemma 3: If fc = 2, then the object can be retrieved from 
any choice of A; = 2 nodes, in which case, we may see 
PSRC{n, k) as a MDS code. 

Proof: If fc 2, then each node stores a = B/2 linearly 
independent vectors. Pick any two nodes say N (containing 



vi, . . . , Va) and N' (similarly storing ui, . . . , Ua)- Suppose 
that there exists a vector u in which is linearly dependent 
of some vectors in N': 

a a 

V = aiVi + bjUj. 

Since v G N and J2"=i '^(^i'^i ^ it must be that 
X]j=i ^j'^^j & N, a contradiction since N and N' are non- 
intersecting by the definition of spread. ■ 

To recover the object, the data collector just solves the 
system of linear equations in o. 

In general, when fc > 3, SRC codes are not maximum 
distance separable (MDS). A static resilience analysis provides 
an estimate of how much deterioration the system may suffer 
due to the lack of the maximum distance separability. 

Static resilience of a distributed storage system is defined as 
the probability that an object, once stored in the system, will 
continue to stay available without any further maintenance, 
even when a certain fraction of individual member nodes of 
the distributed system become unavailable. Let Pnode be the 
probability that any specific node is available. Then, under 
the assumptions that node availability is i.i.d, and no two 
fragments of the same object are placed on any same node, we 
can consider that the availability of any fragment is also i.i.d 
with probabihty pnode. The probabihty pobj of recovering the 
object is then 

n 

Pobj — PxCxPnodei^ — Pnode)"' ^, 

x — k 

where is the conditional probability that the stored object 
can be retrieved by contacting an arbitrary x out of the n 
storage nodes. 

For (n, fc) MDS erasure codes, px is a deterministic and 
binary value equal to one for x > k, and zero for smaller x. For 
self -repairing codes, the value is probabilistic. In Fig. |l(a)| we 
show for our toy example PSRC{21,3) the probability that 
the object cannot be retrieved, i.e., 1 — p^, where the values 
of Px for X > k were determined by exhaustive searchQ 

In particular, one can list 17 unique groups of 5 nodes, 
whose all together 10 basis vectors generate a matrix with 
rank less than 6, out of the (^^) = 20349 unique groups of 5. 
This means that if we choose any 5 arbitrary nodes, the object 
still cannot be retrieved with a probability of 0.00083, which is 
rather negligible. Similarly, if we chose any arbitrary 3 nodes, 
the probability of unretrievability is 0.150375. In contrast, 
for MDS codes, the object will be retrievable from the data 
available at any arbitrary three nodes. Of-course, this rather 
marginal sacrifice provides PSRC an incredible amount of self- 
repairing capability. For any one node lost, as shown earlier 
in Lemma 12] one can choose any of the twenty remaining live 
nodes, and pair it with three other nodes, and regenerate the 
lost data. 

In Fig. |l(b)| we compare the static resilience pobj for 
PSRC{21,3) with respect to what could be achieved using 

^ Px is zero for x < k for PSRC also. 
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Fig. 1. Results for PSRC(21,3) 



a MDS £'C(21,3). The values were determined numerically, 
using the px values evaluated as mentioned above. We note 
that in practice a MDS erasure code may or not exist with 
the specific (n, k) parameters. More importantly, we notice 
that the degradation of static resilience of PS* i? (7 (21, 3) to 
achieve the self-repairing property is marginal with respect to 
that of a MDS erasure code, if such a code were to/does exist. 

IV. Further discussions 

We point out a few more properties of the proposed codes. 

Systematic Like Code: It is usually appreciated from an 
implementation perspective to use a systematic code, since 
it makes the object retrieval immediate. We notice that though 
our code is not systematic, we can however contact B specific 
nodes (instead of fc), namely those storing as pieces each of 
the canonical basis vectors of F2i3 to reconstruct the object in 
a systematic manner 

Bandwidth cost for regeneration: Unlike HSRC, the PSRC 
encoded blocks are not atomic, and instead comprise of a 
pieces. Thus, similar to regenerating codes, one could also 
expect to regenerate an encoded block piece-by-piece, by 
contacting more (larger d) number of nodes. For example, 
when using PS" i?C (21, 3), if the data for node Ni needs 
to be regenerated, one could do so by contacting two nodes 
and downloading four pieces (units) of data, as we have 
already seen. One could instead also contact d = 3 nodes, 
and regenerate the two lost pieces by downloading only three 
units of data. For instance, by downloading (010000) from iV2, 
(110000) from Nj and (000111) from TVg. 

As noted previously, for our examples, a = B/k, corre- 
sponding to what is known as the Minimum Storage Regen- 
eration (MSR) point for regenerating codes. At MSR point, a 
node needs to contact d > k nodes, and download , ^, , , , 
data from each, resulting in a total download of 
data. Thus, for the same choices of a, B, k and with d — 3, 
one would need to download 6 units of data, and for d = 4, 
one would need to download 4 units of data, while d = 2 
is not allowed. Thus, for the regeneration of one lost node. 



PSRC can outperform regenerating codes both in terms of 
absolute bandwidth needed, as well as the number of nodes 
needed to carry out such regeneration, moreover, for upto 
(n — l)/2 failures, the regeneration overhead per node's data 
stays constant for PSRC. It of-course needs to be noted that, 
in order to achieve these very interesting performance, we 
sacrificed the MDS property. In practice, this sacrifice however 
has marginal impact, as can be observed from the resulting 
codes' static resilience. 

V. Concluding remarks 

In this work, we showed the existence of another instance of 
self-repairing codes, which are codes tailor made to meet the 
peculiarities of distributed networked storage. The proposed 
code family in this paper is based on constructions of spreads 
from projective geometry. We provided a preliminary study 
of the properties of this new family, demonstrating that they 
outperform existing code famiUes both in several quantitative 
as well as qualitative metrics. Further analysis to comprehend 
and harness these codes in practical settings are currently 
under investigation. 
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